/ 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



July 28, 2005, 10:51:32 ; Search time 163 Seconds 

(without alignments) 
28.473 Million cell updates/sec 

US-10-618-336A-6 
70 

1 MCPTDMPASLCM 12 
BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 2105692 seqs, 386760381 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



2105692 



Database 



A Genese 
genes 
genes 
genes 
genes 
genes 
genes 
genes 
genes 



q_16Dec04 : * 
eqpl980s : * 
eqpl990s: * 
eqp2000s: * 
eqp2001s: * 
eqp2002s : * 
eqp2003as : * 
eqp2003bs:* 
eqp2004s : * 



Pred. No. is the number of . results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 


44 


62.9 


121 


4 


AAB95737 


Aab95737 Human pro 


2 


44 


62. 9 


296 


8 


ADJ50939 


Adj 50939 Human nov 


3 


44 


62.9 


307 


8 


ADJ50941 


Adj50941 Human nov 


4 


44 


62.9 


329 


8 


ADM87220 


Adm87220 Human pro 


5 


44 


62.9 


351 


5 


AAE18955 


Aael8955 Human eel 


6 


44 


62.9 


351 


8 


ADN01129 


Adn01129 Human eel 


7 


42 


60.0 


262 


3 


AAG43457 


Aag43457 Arabidops 


8 


42 


60. 0 


262 


3 


AAG23843 


Aag23843 Arabidops 


9 


42 


60.0 


312 


6 


ABU23840 


Abu23840 Protein e 



10 


42 


60 . 0 


344 


7 


ADC8714 9 


Adc8714 9 

f*i\JL \*\J 1 XT-? 


Human f^PP 


11 


42 


60 . 0 


351 


3 


AAG2 3842 


Aaa23842 


Arahi Hons 


12 


42 


60. 0 


351 


3 


AAG43456 


Aaa43456 


Arabi dons 


13 


42 


60 . 0 


381 


3 


AAG4 34 SS 


Aaa4 34 SS 

nu U T J T v/ J 


Arabi Hons 

AL ClJJi.UUU<9 


14 


42 


60 . 0 


381 


3 


AAG2 3841 


Aaa23841 

nu u £■ j u i i 


Arabi dons 

ni au _i_ u o 


15 


41 


58 . 6 


90 




ARU61 6R7 


Abu 61 6R7 


Prnrfram H 
t lalll u 


16 


41 


58 . 6 


90 


7 


ADR66818 


Adb 66818 


Po1" a s s i \itti 


17 


41 


58 . 6 


348 


1 


AAP94 682 


Aan94 682 

Ah lj ^ *l vj \J 


Human man 


18 


41 


58 . 6 


648 


8 


AD059397 


AdoS9 3 97 


Human fan 


19 


41 


58 . 6 


856 


7 


ARW01 4 0 9 


AhwOI 4 09 


Human HFA 
nun icx x i nuA 


20 




58.6 


962 


3 


AAY4 994 4 


Saw 49944 
na y i ^ j 1 1 


XiU.iiia.il ky \j l. 


21 


41 


58 . 6 


988 


4 


AAB31714 


Aab31714 

nsu ji / _i_ * 


A Hi ima n a 

AY 1 1 Llllld 1 1 a 


22 


41 


58 . 6 


988 


4 


AAE01043 


Aae01043 


Human eth 


23 


4 1 


58.6 


98 8 


5 


ARG70999 


Aba7 0999 


Human h ra 

liLllLLdll L L a 


24 


4 1 
i x 


Sfl 6 

«J U • VJ 


9fi R 

Z? \J U 


s 


ARR7 61 6S 


Abb7 61 6S 


WiiTnan not" 

i 1 Li.1 LLCl 1 1 [JUL 


2 S 


4 1 


58 . 6 


98 8 


5 


AAOI 4 214 


A^ol 4 2 1 4 

n.d uiili^ 


Hi ima n r r*a 


2 6 


4 1 


58 . 6 


98 8 


7 


ADC77 6SS 

AIJ ^ / / \J «J <J 


AHr-77 6SS 


Hi ima n ^ ^ 4 
fi nu ia ii 


27 


4 1 

1 X 


58 6 

«J u * u 


98 8 


7 


ARW01 364 


AbwOI 364 


Human not - 

iiL-iiiiaii jju l 


2 ft 


4 1 

1 X 


SR 6 


988 


8 


ADOS94 DO 

n.U^ J J T U U 


AdnS94 GO 


Human oan 


2 9 


41 


58.6 


98 9 


3 


AAY4 9 94 5 


Aav4 994 5 


Human nnt" 

ii Ml l LCI 11 V-J ^/ O 


30 


4 1 


58.6 


98 9 


7 


ADC77 667 


Adc77 667 


Human 332 

1 1 U.1 1 Id 11 >-/ *J c* 


31 


41 


58 . 6 


989 


8 


ADR4 4 914 


Adr4 4 914 


Pol vnent i 


32 


41 


58 . 6 


1174 


4 


ABB65304 


Abb65304 


Drosonhil 


33 


40 


57 . 1 


59 


8 


AB056119 


Abo56119 


Human gen 


34 


4 0 


57 . 1 


84 


4 


AAU6308 5 

nnu \j yj \j ^ 


Aau63085 


P ron i on i b 




4 D 


S7 1 

■J / • X 


R 4 


vj 


ARMS 9 604 


AbmS 9 604 


p rnni on i b 


36 


4 0 


57 . 1 


699 


8 


AB084 631 

nu uu i u ji 


Abo84631 


Mou se can 


17 


40 


57 . 1 


117 8 


8 


ADP74 025 


Ado7 4 025 


Mil ri np CD 

1 1 v-i i_ -i_ 1 1 ^ >w 1*/ 


o o 


39 


sj «J • / 


85 


4 


AAU21335 

nnu t. u j j 


Aau21335 


Human nov 

1 1 V-4i I Id 11 IIW V 


39 


39 


55 . 7 


103 


4 


AAU 59241 


Aau59241 


Propionib 


40 


39 


55.7 


103 


6 


ABM55760 


Abm55760 


Propionib 


41 


39 


55.7 


166 


7 


AB075576 


Abo75576 


Pseudomon 


42 


39 


55.7 


194 


3 


AAG61086 


Aag61086 


Arabidops 


43 


39 


55.7 


194 


3 


AAG56099 


Aag56099 


Arabidops 


44 


39 


55.7 


198 


3 


AAG56098 


Aag56098 


Arabidops 


45 


39 


55.7 


198 


3 


AAG61085 


Aag61085 


Arabidops 



ALIGNMENTS 



RESULT 1 
AAB95737 

ID AAB95737 standard; protein; 121 AA. 
XX 

AC AAB95737; 
XX 

DT 26-JUN-2001 (first entry) 
XX 

DE Human protein sequence SEQ ID NO: 18 627. 
XX 

KW Human; primer; detection; diagnosis; antisense therapy; gene therapy. 
XX 

OS Homo sapiens. 
XX 

PN EP1074617-A2 . 
XX 



PD 07-FEB-2001. 
XX 

PF 28-JUL-2000; 2000EP-00116126 . 
XX 

PR 29-JUL-1999; 99 JP-0024 8036 . 

PR 27-AUG-1999; 99 JP-00300253*. 

PR ll-JAN-2000; 2000 JP-00118776 . 

PR 02-MAY-2000; 2000JP-00183767 . 

PR 09-JUN-2000; 2000 JP-0024 1899 . 
XX 

PA (HELI-) HELIX RES INST . 
XX 

PI Ota T, Isogai T, Nishikawa T, Hayashi K, Saito K, Yamamoto J; 

PI Ishii S, Sugiyama T, Wakamatsu A, Nagai K, Otsuki T; 

XX 

DR WPI; 2001-318749/34. 
XX 

PT Primer sets for synthesizing polynucleotides, particularly the 5602 full- 

PT length cDNAs defined in the specification, and for the detection and/or 

PT diagnosis of the abnormality of the proteins encoded by the full-length 

PT cDNAs . 
XX 

PS Claim 8; SEQ ID NO 18627; 2537pp + Sequence Listing; English. 
XX 

CC The present invention describes primer sets for synthesising 5602 full- 

CC length cDNAs defined in the specification. Where a primer set comprises: 

CC (a) an oligo-dT primer and an oligonucleotide complementary to the 

CC complementary strand of a polynucleotide which comprises one of the 5602 

CC nucleotide sequences defined in the specification, where the 

CC oligonucleotide comprises at least 15 nucleotides; or (b) a combination 

CC of an oligonucleotide comprising a sequence complementary to the 

CC complementary strand of a polynucleotide which comprises a 5' -end 

CC sequence and an oligonucleotide comprising a sequence complementary to a 

CC polynucleotide which comprises a 3 ! -end sequence, where the 

CC oligonucleotide comprises at least 15 nucleotides and the combination of 

CC the 5 ! -end sequence/3 1 -end sequence is selected from those defined in the 

CC specification. The primer sets can be used in antisense therapy and in 

CC gene therapy. The primers are useful for synthesising polynucleotides, 

CC particularly full-length cDNAs . The primers are also useful for the 

CC detection and/or diagnosis of the abnormality of the proteins encoded by 

CC the full-length cDNAs . The primers allow obtaining of the full-length 

CC cDNAs easily without any specialised methods. AAH03166 to AAH13628 and 

CC AAH13633 to AAH18742 represent human cDNA sequences; AAB92446 to AAB95893 

CC represent human amino acid sequences; and AAH13629 to AAH13632 represent 

CC oligonucleotides, all of which are used in the exemplification of the 

CC present invention 

XX 

SQ Sequence 121 AA; 

Query Match . 62.9%; Score 44; DB 4; Length 121; 
Best Local Similarity 66.7%; Pred. No. 28; 

Matches 8; Conservative 1; Mismatches 3; Indels 0; Gaps 0 

Qy 1 MCPTDMPASLCM 12 

III : I I I I I 

Db 7 6 MCPVPVPASLSM 87 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



July 28, 2005, 10:55:53 ; Search time 41 Seconds 

(without alignments) 
21.849 Million cell updates/sec 



Title: US-10-618-336A-6 

Perfect score: 70 

Sequence: 1 MCPTDMPASLCM 12 
Scoring table: 



Searched: 



BLOSUM62 
Gapop 10.0 , Gapext 0.5 

513545 seqs, 74649064 residues 



Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



513545 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/l/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/l/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep: * 

5 : /cgn2_6/ptodata/ l/iaa/PCTUS_COMB . pep : * 

6 : /cgn2_6/ptodata/l/iaa/backfilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% . 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 


41 


58. 


6 


90 


4 


US-09-358-383C-31 


Sequence 


31, Appl 


2 


41 


58. 


6 


960 


4 


US-09-694-777A-21 


Sequence 


21, Appl 


3 


. 41 


58. 


6 


962 


4 


US-09-614-480-9 


Sequence 


9, Appli 


4 


41 


58. 


6 


962 


4 


US-09-694-777A-3 


Sequence 


3, Appli 


5 


41 


58. 


6 


962 


4 


US-09-694-777A-24 


Sequence 


24, Appl 


6 


41 


58. 


6 


962 


4 


US-10-422-075-9 


Sequence 


9, Appli 


7 


41 


58. 


6 


987 


4 


US-09-694-777A-22 


Sequence 


22, Appl 


8 


41 


58. 


6 


988 


4 


US-09-614-480-2 


Sequence 


2, Appli 


9 


41 


58. 


6 


988 


4 


US-10-162-012-5 


Sequence 


5, Appli 


10 


41 


58. 


6 


988 


4 


US-10-162-012-12 


Sequence 


12, Appl 


11 


41 


58. 


6 


988 


4 


US-10-422-075-2 


Sequence 


2, Appli 



12 


41 


58 . 


6 


989 


4 


us- 


09- 


694- 


777A-4 


Sequence 


4 . Aooli 


13 


41 


58 . 


6 


989 


4 


us- 


09- 


694- 


777A-23 


Sequence 


23, AddI 


14 


40 


57 . 


1 


96 


4 


us- 


09- 


403- 


343B-24 


Sequence 


24, AddI 


15 


39 


55 . 


7 


166 


4 


us- 


09- 


252- 


991A-24322 


Sequence 


24322, A 


16 


38 


54 . 


3 


103 


4 


us- 


09- 


252- 


991A-28978 


Sequence 


28978, A 


17 


38 


54 . 


3 


2500 


2 


us- 


08- 


801- 


263A-2 






18 


38 


54 . 


3 


2500 


3 


us- 


09- 


102- 


248-2 




2 Add 1 i 


19 


38 


54 . 


3 


2500 


4 


us- 


09- 


367- 


764-2 


Seauence 


2. AddIi 


20 


38 


54 . 


3 


2517 


2 


us- 


08- 


801- 


2 63A-5 


Seauence 


5 . AddI i 

W f Ji. KS k-/ «^ 


21 


38 


54 . 


3 


2517 


3 


us- 


09- 


102- 


248-5 


*J VJ Uv-ll ' — . 


5 Add 1 i 


22 


38 


54 . 


3 


2517 


4 


us- 


09- 


367- 


764-5 


S e an encp 


S Airn 1 i 

^ , nu k-/ -L _i_ 


23 


37 


52 . 


9 


250 


4 


us- 


09- 


270- 


767-61069 


S ecru en ce 


61069 A 


24 


37 


52 . 


9 


433 


4 


us- 


09- 


270- 


767-45556 


Sequence 


45556 A 


25 


37 


52 . 


9 


534 


4 


US- 


09- 


270- 


1 67-4 6023 




4fi0?? A 


26 


37 


52 . 


9 


3830 


4 


us- 


09- 


693- 


2 0 5 A- 4 




" , ji.^ pii 


27 


36 


51 . 


4 


42 


4 


us- 


09- 


270- 


767-31926 




31926 A 


2 8 


36 


51 . 


4 


42 


4 


US- 


09- 


270- 


767-4 71 4 1 






29 


36 


51 . 


4 


200 


4 


us- 


09- 


252- 


9 9 1 A- 187 42 


S ecru en ce 


18742 A 


30 


36 


51 . 


4 


256 


4 


us- 


09- 


949- 


016-7576 




7 57 6 Ad 


31 


36 


51 . 


4 


256 


4 


us- 


09- 


949- 


016-7577 


Secfuence 

*^ U ^ — - 1 1 V— ' * — - 


7577 Ad 

t -*J l If ilk' 


32 


36 


51 . 


4 


280 


4 


us- 


09- 


949- 


016-9249 


fcj C VJ. V3 1 1 ^ \3 


924 9, Ap 


33 


36 


51 . 


4 


280 


4 


us- 


09- 


949- 


016-9250 


55 pcnj en rp 

O v3 UClll^C 


925 0 Ad 


34 


36 


51 . 


4 


306 


4 


us- 


09- 


270- 


767-41758 


Sequence 


41758, A 


35 


36 


51 . 


4 


330 


3 


us- 


09- 


188- 


930-125 


Seauence 

W IpA V_ 1 A W 


125, Add 


36 


.36 


51 . 


4 


330 


4 


us- 


09- 


312- 


283C-125 


S em ipnrp 


1 9 S Ann 


37 


36 


51 . 


4 


4 04 


3 


us- 


09- 


191- 


608-18 


^» em ipnrp 


1 8 Annl 


38 


36 


51 . 


4 


431 


4 


us- 


09- 


270- 


767-57907 


\w VJ L*l w 1 1 W V— 


57907 A 


39 


36 


51. 


4 


443 


4 


us- 


09- 


266- 


965-131 


Sequence 


131, App 


40 


36 


51. 


4 


447 


3 


us- 


09- 


191- 


608-19 


Sequence 


19, Appl 


41 


36 


51. 


4 


471 


3 


us- 


09- 


191- 


608-17 


Sequence 


17, Appl 


42 


36 


51. 


4 


497 


3 


us- 


09- 


191- 


608-20 


Sequence 


20, Appl 


43 


36 


51. 


4 


555 


4 


us- 


09- 


270- 


767-42597 


Sequence 


42597, A 


44 


36 


51. 


4 


617 


3 


us- 


09- 


188- 


930-303 


Sequence 


303, App 


45 


36 


51. 


4 


617 


4 


us- 


09- 


312- 


283C-303 


Sequence 


303, App 



ALIGNMENTS 



RESULT 1 x 
US-09-358-383O31 

; Sequence 31, Application US/09358383C 

; Patent No. 6518398 

; GENERAL INFORMATION: 

; APPLICANT: Curtis, Rory A.J. 

; TITLE OF INVENTION: NOVEL POTASSIUM CHANNEL MOLECULES AND USES THEREFOR 

FILE REFERENCE: MNI-055CP 
; CURRENT APPLICATION NUMBER: US/09/358 , 383C 
; CURRENT FILING DATE: 1999-07-21 
; PRIOR APPLICATION NUMBER: USSN 09/119,855 
; PRIOR FILING DATE: 1998-07-21 
; NUMBER OF SEQ ID NOS : 36 
; SOFTWARE: Patentln Ver. 2.0 
;. SEQ ID NO 31 

LENGTH: 90 

TYPE: PRT 
/ ORGANISM: Artificial Sequence 



; FEATURE : 

; OTHER INFORMATION: Description of Artificial Sequence: domain 
US-09-358-383C-31 



Query Match 58.6%; Score 41; DB 4; Length 90; 

Best Local Similarity 54.5%; Pred. No. 15; 

Matches 6; Conservative 2; Mismatches 3; Indels 

Qy 2 CPTDMPASLCM 12 

I I I I I : I : 
Db 57 CPKDMKADICV 67 



0; Gaps 



0; 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



July 28, 2005, 11:04:24 ; Search time 155 Seconds 

(without alignments) 
30.163 Million cell updates/sec 

US-10-618-336A-6 
70 

1 MCPTDMPASLCM 12 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1745140 seqs, 389608008 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1745140 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_AA: * 

1: /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3: /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB. pep : * 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/2/pubpaa/US09AJPUBCOMB.pep:* 

10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 

11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep:* 

12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB . pep : * 

13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB . pep : * 

14 : /cgn2__6/ptodata/2/pubpaa/US10B_PUBCOMB . pep : * 

15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 



16 
17 
18 
19 
20 
21 
22 



/cgn2_6/ptodata/2/pubpaa/US10D_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US10E_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB . pep : * 
/cgn2_6/ptodata/2/pubpaa/USllA_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/USllJ*EW_PUB.pep:* 
/cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB . pep : * 
/cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 

1 


4 4 

4 4 


69 


Q 

-7 


4 7 

4 f 


1 s 

X J 


ttc-1 0-494-SQQ-1 Q74 S6 

U *_> X U Iti .J _7 _7 l^OIJU 


Cprfi 1 pn pp 


1 Q^4 S6 

X -7 O 1 J O f 


9 
z 


4 4 
4 4 


OZ . 


Q 

-/ 


9 Q6 
z o 


1 ^ 

1 D 


t t c — i 0-91 0-981 -4 

UO X U C. X U ^Ol 4 




4 J\ nn 1 "5 

4 r n.ppil 


o 


4 4 
4 4 


OZ . 


Q 
_? 


707 
o u / 


1 ^ 
X D 


ttc_i 0-91 0-9P.1 - 6 

XU Z X U £01 0 




67 Appli 


A 
4 


4 4 
4 4 


69 


Q 
-7 


O Z -7 


X z> 


ttc_i 0-1 1 9-Q44-71 7 

Uo XU XXZ -744 Jl J 


OcvJUCIlLc 


717 A "ft. 

oio r /\pp 


D 


4 4 
4 4 


69 
OZ . 


q 


7^1 


X nJ 


TTQ-1 0-777-^79-7 

UO XU OOO Die. *J 


Q a 1 y/"-> >i 0 
OCVjUCULC 


3 , Appli 


c. 
0 


4 9 
4 Z 


6D 


U 


-31 O 
J 1Z 


1 R 
X 0 


ttc_ 1 0-9A9-1 99^-^1 764 
UO XU lOZ IZZn Jl / D4 




M 7 64 a- 

Jl / D4 ^ r\ 


1 


4Z 


6n 

DU . 


u 


"3 4 4 
o 4 4 


1 4 
J. 4 


ttq_*j 0—01 7 — 1 61 — 1 0^4 
Uo XU Ul / xOX xy04 


Ocquence 


1 ^7 O 4 , r\p 


Q 
0 


4Z 


6 o 


u 


7 4 4 


1 D 


ttc_ 1 0— 9Q9 — 7QR — 1 6H9 

uoxuzyz /^oxouz 


oequence 


1 60 9 2ir> 
1 O U Z ^ r\p 


Q 

y 


4 £ 


6n 

DU . 


u 


7 ft 1 
O 0 1 


1 6 
1 0 


tto _ 1 0 — 7^0—0^0—^649 
Uo xU / 01? 3jU JDIZ 


0 ^ /»fi 1 ft »-» ft ft 
oequence 


S649 2\r> 
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ALIGNMENTS 



RESULT 1 

US-10-424-599-193456 

; Sequence 193456, Application US/10424599 

; Publication No. US2 004 0031072A1 

; GENERAL INFORMATION: 

; APPLICANT: La Rosa Thomas J 

; APPLICANT: Kovalic David K 

; APPLICANT: Zhou Yihua 

; APPLICANT : Cao Yongwei 

; TITLE OF INVENTION: Soy Nucleic Acid Molecules and Other Molecules Associated 
With 

; TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 

; FILE REFERENCE: 38-21 (53223 ) B 

; CURRENT APPLICATION NUMBER: US/10/424,599 

; CURRENT FILING DATE: 2003-04-28 

; NUMBER OF SEQ ID NOS : 285684 

; SEQ ID NO 193456 

LENGTH: 47 

TYPE: PRT 
; ORGANISM: Glycine max 
; FEATURE : 

OTHER INFORMATION: Clone ID: PAT_MRT3847__16715C ; 1 . pep 
US-10-424-599-193456 

Query Match 62.9%; Score 44; DB 15; Length 47; 

Best Local Similarity 50.0%; Pred. No. 13; 

Matches 6; Conservative 4; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 MCPTDMPASLCM 12 

: I I : I I : II: 
Db 9 LCQTNMPSKLCL 20 

GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: July 28, 2005, 10:55:03 ; Search time 38 Seconds 

(without alignments) 
30.384 Million cell updates/sec 

Title: US-10-618-336A-6 
Perfect score: 70 

Sequence: 1 MCPTDMPASLCM 12 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 283416 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : PIR_79:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
T49799 

related to T0M1 protein [imported] - Neurospora crassa 
N; Alternate names: protein B11B22.10 
C; Species: Neurospora crassa 

C;Date: 02-Jun-2000 #sequence_revision 02-Jun-2000 #text_change 09-Jul-2004 
C; Accession: T4 97 9 9 

R;Schulte, U. ; Aign, V./ Hoheisel, J.; Brandt, P.; Fartmann, B. ; Holland, R. ; 

Nyakatura, G. ; Mewes, H.W.; Mannhaupt, G. 

submitted to the Protein Sequence Database, May 2000 

A;Reference number: Z25022 

A;Accession: T49799 

A; Status: preliminary 

A; Molecule type: DNA 

A;Residues: 1-3839 <SCH> 

A;Cross-references: UNIPROT : Q9P4Z1 ; EMBL : AL356834 ; GSPDB: GN00116; NCSP : B11B22 . 10 
A; Experimental source: BAC clone B11B22; strain OR74A 
C; Genetics: 

A;Gene: NCSP : B11B22 . 10 
A;Map position: 6 

A;Introns: 16/3; 2607/1; 2623/1; 2658/1; 2845/1; 2987/2; 3204/3; 3694/1; 3809/1 



Query Match 67.1%; 
Best Local Similarity 70.0%; 
Matches 7; Conservative 



Score 47; DB 2; 
Pred. No. 21; 
3; Mismatches 



Length 3839; 



0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 MCPTDMPASL 10 
111:1111:: 
381 MCPSDMPATM 3 90 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



July 28, 2005, 10:54:23 ; Search time 174 Seconds 

(without alignments) 
35.316 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-618-336A-6 
70 

1 MCPTDMPASLCM 12 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1612378 seqs, 512079187 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1612378 



Database : 



UniProt_03: * 
1 : uniprot_sprot : * 
2 : uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
T0M1_NEUCR 

ID T0M1_NEUCR STANDARD; PRT; 4 065 AA. 

AC Q9P4Z1; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004' (Rel. 45, Last annotation update) 

DE E3 ubiquitin protein ligase TOMl-like protein (EC 6.3.2.-). 

GN ORFNames=BllB22 . 010, NCU08501.1; 

OS Neurospora crassa. 

OC Eukaryota; Fungi; Ascomycota; Pezizomycotina ; Sordariomycetes ; 

OC Sordariomycetidae; Sordariales; Sordariaceae ; Neurospora. 

OX NCBI_TaxID=5141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=74-0R23-1A / FGSC 987; 

RX MEDLINE=22542210; PubMed=12655011 ; DOI=10 . 1093/nar/gkg293 ; 

RA Mannhaupt G., Montrone C, Haase D., Mewes H.-W., Aign V., 

RA Hoheisel J.D., Fartmann B., Nyakatura G., Kempken F. , Maier J., 

RA Schulte U. ; 

RT "What's in the genome of a filamentous fungus? Analysis of the 

RT Neurospora genome sequence."; 

RL Nucleic Acids Res. 31:194 4-1954(2003). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=74-0R23-1A / FGSC 987; 

RX PubMed=12712197; DOI=10 . 1038/nature01554 ; 

RA Galagan J.E., Calvo S.E., Borkovich K.A., Selker E.U., Read N.D., 

RA Jaffe D., FitzHugh W., Ma L.-J., Smirnov S., Purcell S., Rehman B., 



RA Elkins T., Engels R. , Wang S., Nielsen C.B., Butler J. , Endrizzi M. , 

RA Qui D., Ianakiev P., Bell-Pedersen D. , Nelson M.A. , 

RA Werner-Washburne M. , Selitrennikof f CP., Kinsey J. A., Braun E.L., 

RA Zelter A. , Schulte U., Kothe G.O., Jedd G., Mewes H.-W., Staben C, 

RA Marcotte E. , Greenberg D., Roy A., Foley K., Naylor J., 

RA Stange-Thomann N. , Barrett R. , Gnerre S., Kamal M. , Kamvysselis M. , 

RA Mauceli E., Bielke C, Rudd S., Frishman D., Krystofova S., 

RA Rasmussen C, Metzenberg R.L., Perkins D.D., Kroken S., Cogoni C, 

RA Macino G. , Catcheside D., Li W., Pratt R.J., Osmani S.A., 

RA DeSouza CP., Glass L., Orbach M.J., Berglund J. A. , Voelker R. , 

RA Yarden 0., Plamann M. , Seiler S., Dunlap J., Radford A., Aramayo R., 

RA Natvig D.O., Alex L.A., Mannhaupt G. , Ebbole D.J., Freitag M. , 

RA Paulsen I., Sachs M.S., Lander E.S., Nusbaum C, Birren B.; 

RT "The genome sequence of the filamentous fungus Neurospora crassa."/ 

RL Nature 422:859-868(2003). 

CC -!- FUNCTION: Probable ubiquitin ligase protein, which may be involved 

CC in mRNA export. E3 ubiquitin ligase protein mediate ubiquitination 

CC and subsequent proteasomal degradation of target proteins. 

CC Participates in mRNA export from the nucleus by regulating the 

CC transport of hnRNP proteins (By similarity) . 

CC -!- PATHWAY: Ubiquitin conjugation; third step. 

CC -!- SUBCELLULAR LOCATION: Nuclear (By similarity). 

CC -!- SIMILARITY: Belongs to the TOM1/PTR1 family. 

CC -!- SIMILARITY: Contains 1 HECT-type E3 ubiquitin-protein ligase 
CC domain. 

CC • 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
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DR EMBL; 'AL356834; CAB92704.2; -. 

DR EMBL; AABX01000134 ; EAA34194.1; -. 

DR PIR; T49799; T49799; 

DR HSSP; Q05086; 1C4Z. 

DR InterPro; IPR010309; DUF908. 

DR InterPro; IPR010314; DUF913. 

DR InterPro; IPR000569; HECT_domain. 

DR Pfam; PF06012; DUF908; 1. 

DR Pfam; PF06025; DUF913; 1. 

DR Pfam; PF00632; HECT; 1. 

DR SMART; SM00119; HECTc; 1. 

DR PROSITE; PS50237; HECT; 1. 

KW Coiled coil; Ligase; mRNA transport; Nuclear protein; Transport; 

KW Ubl conjugation pathway. 

FT DOMAIN 3729 4 065 HECT. 

FT DOMAIN 1263 1287 Coiled coil (Potential) . 

FT DOMAIN 2491 2568 Coiled coil (Potential) . 

FT DOMAIN 2839 2924 Coiled coil (Potential) . 

FT DOMAIN 3320 3369 Coiled coil (Potential) . 

FT BINDING 4032 4032 Ubiquitin (By similarity) . 

SQ SEQUENCE 4065 AA; 452568 MW; F74683CEC36F9350 CRC64 ; 



Query Match 



67.1%; Score 47; DB 1; Length 4065; 



Best Local Similarity 70.0%; Pred. No. 65; 
Matches 7; Conservative 3; Mismatches 



Qy 1 MCPTDMPASL 10 

I I I : I I I I : : 
Db 381 MCPSDMPATM 390 



